Korpus: plt_wikipedia_2016_100K

Weitere Korpora

3.6.2 Zipf's law for words of fixed lengths

Zipf distribution of words of fixed length 4, 6, 8, ..., 14.


Zipf's diagram for words of fixed length


Gnuplot diagram

Top Words of length 4
word rank frequency word
1 23496 2008
2 17372 Ilay
3 13428 2014
4 5365 2006
5 3190 1999
Top Words of length 6
word rank frequency word
1 20565 tanàna
2 16112 ben'ny
3 1429 teraka
4 1163 mizaka
5 1077 manome
Top Words of length 8
word rank frequency word
1 19420 tamin'ny
2 17628 mampiasa
3 17572 desimaly
4 17351 fanisana
5 3263 toradroa
Top Words of length 10
word rank frequency word
1 17339 mpikambana
2 4135 habakabaka
3 2503 Communauté
4 1839 Atsinanana
5 553 mpanoratra
Top Words of length 12
word rank frequency word
1 17301 isam-ponin'i
2 6125 manodidin'ny
3 4114 madinidinika
4 248 Madagasikara
5 212 horonantsary
Top Words of length 14
word rank frequency word
1 6116 hidodikodonany
2 85 Préardennaises
3 39 tendrombohitra
4 27 sambon-danitra
5 25 soson-drivotra
Slope for length 4
Slope
-1.188288478528256
Slope for length 6
Slope
-1.0912786506524563
Slope for length 8
Slope
-1.188288478528256
Slope for length 10
Slope
-0.9842414742769676
Slope for length 12
Slope
-0.8217263382430937
Slope for length 14
Slope
-0.5880456295278407
812 msec needed at 2018-01-12 13:11